The Sorting Buffer Problem is NP-hard 



Ho-Leung Chan* Nicole Megow^ Rob van Stee* Rene Sitters* 

September 23, 2010 

Abstract 

We consider the offline sorting buffer problem. The input is a sequence of items 
of different types. All items must be processed one by one by a server. The server is 
equipped with a random-access buffer of limited capacity which can be used to rear- 
range items. The problem is to design a scheduling strategy that decides upon the order 
in which items from the buffer are sent to the server. Each type change incurs unit cost, 
and thus, the cost minimizing objective is to minimize the total number of type changes 
for serving the entire sequence. This problem is motivated by various applications in 
manufacturing processes and computer science, and it has attracted significant atten- 
tion in the last few years. The main focus has been on online competitive algorithms. 
Surprisingly little is known on the basic offline problem. 

In this paper, we show that the sorting buffer problem with uniform cost is NP- 
hard and, thus, close one of the most fundamental questions for the offline problem. 
On the positive side, we give an (9(l)-approximation algorithm when the scheduler is 
given a buffer only slightly larger than double the original size. We also give a dynamic 
programming algorithm for the special case of buffer size two that solves the problem 
exactly in linear time, improving on the standard DP which runs in cubic time. 

1 Introduction 

The sorting buffer problem results from the following scenario. The input is a sequence a 
of n items of different types. W.l.o.g., we represent different types by different colors, i.e., 
each item i is associated with a color c(i). The total number of distinct colors in the sequence 
is denoted by C. All items must be processed by a server. The server is equipped with a 
random-access buffer of limited capacity which can be used to rearrange the items. The 
items are moved one after another into the buffer that can hold at most k items. At any step, 
a scheduling algorithm chooses a color, say c, and then all items in the buffer associated with 
color c are removed from the buffer and processed by the server. This creates space in the 
buffer, and the next items in the sequence will be moved to the buffer. If some of these new 
items have color c, they will be removed and processed immediately and it continues until no 
item in the buffer is associated with color c. The scheduling algorithm then chooses a new 
color and repeats, until all items in the sequence are removed for processing. The goal is to 
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design a scheduling algorithm that minimizes the total number of color changes. The buffer 
has no color initially. 

While the sorting buffer problem looks simple, it models a number of important problems 
in manufacturing processes, hardware design, computer graphics, file servers and informa- 
tion retrieval. For example, consider the sequencing problem in an automotive paint shop (9j, 
where cars are painted in different colors. The cars traverse this production stage consecu- 
tively, and whenever a color change is necessary, this causes setup and cleaning cost. The 
goal is to minimize the total cost for changing colors. For an extended discussion on various 
applications and more references, we refer the readers to, e.g., ||3]|6 12 1. 



1.1 Previous work 

The sorting buffer problem (also known as buffer reordering problem) has attracted signif- 
icant attention since it was first proposed by Racke, Sohler, and Westermann |12j. The 
original focus was on competitive analysis of online algorithms. Racke et al. fT2| proposed 
an 0(log 2 k) -competitive algorithm and showed that some simple heuristics like First In First 
Out (FIFO) and Least Recently Used (LRU) are n(\/^)-competitive, where k > is the buffer 
size. Englert and Westermann f7J improved these results and gave an O (log k) -competitive 
algorithm for a more general non-uniform cost function, where the cost of a color change 
depends on the final color. To obtain this result, they first relate their algorithm's solution 
to an optimal offline solution using a buffer of size it/4. Then, they prove that an offline 
optimum with buffer size k/4 is O (log k) -competitive against an offline optimum with buffer 
size k. The first result translates into a constant competitiveness result using resource aug- 
mentation, i.e., their algorithm is 4-competitive when given a buffer with size 4 times the 
original size. The currently best known result was derived very recently by Avigdor-Elgrabli 
and Rabani BJ; they gave an O (log kj log log/:) -competitive deterministic online algorithm 
for the sorting buffer problem with non-uniform costs. 

Considerable work has been done for the problem when the cost function is a metric and 
the cost of a color change depends on both the original and final colors. We do not review 
the results here and refer the readers to the summary in [ 3] . 

In order to develop good online methods, one of the most natural steps is to investigate 
the offline sorting buffer problem and identify its structural properties. Even if the offline 
problem is less relevant in practice, its analysis should be easier and give new insight to the 
problem. However, only little is known on the offline problem. It is easy to see that there are 
dynamic programming algorithms that solve the problem optimally in 0{n +l ) or 0(n c+l ) 
time; see also [TUJTTJ. Aiming at polynomial time algorithms, the above mentioned online 



algorithms already provide the best known offline approximation guarantees (which are non- 
constant). A constant approximate algorithm for the offline case on the line metric has been 
derived by Khandekar and Pandit fTO| ; however, it runs in quasi -polynomial time. 

There has been research on the complementary variant of the sorting buffer problem, 
where the objective is to maximize the number of avoided color changes in the input stream. 
This problem is more successful in terms of approximation algorithms. Kohrt and Pruhs (TTJ 
gave a polynomial time 20-approximate algorithm. This was later improved by Bar- Yehuda 
and Laserson Q who gave a 9-approximation algorithm for non-uniform cost. Note that the 
maximization and minimization problems have the same optimal solution, but they may be 
very different in terms of approximation. 



2 



1.2 Our results 



We give a concise NP-hardness proof for the sorting buffer problem by a reduction from 
3-Partition [8], and hence close one of the fundamental open questions on this problem (3). 
Clearly, this implies that both variants, the minimization and the maximization problem, are 
NP-hard. Independently, an NP-hardness proof was given by Asahiro et al. (TJ. However, 
their proof is much longer than ours and turned out to be incorrect. Recently, they gave a 
new, though still very long, proof (2j. 

We also note, that increasing the number of servers does not make the problem easier. 
The idea of modeling more servers leads to an intuitive generalization of (or joint model 
for) the sorting buffer problem and the somewhat related well-known paging problem. In 
the latter problem, there is given a cache of m colors while a request from an online request 
sequence must be served immediately without intermediate buffering. We could interpret the 
cache as m servers that may immediately serve a current request. This leads to a generalized 
sorting buffer problem in which we have a buffer of size k and m servers. In this general 
formulation, the sorting buffer problem corresponds to the special case with m = 1, while 
the paging problem has k=l. Yet, the earlier problem is NP-hard, as we show in this paper, 
while the later problem is polynomial-time solvable |5J . 

Naturally, we also consider an immediate adaption of the optimal paging algorithm J5J 
Longest Forward Distance (LFD) as a candidate for the sorting buffer problem. However, 
we show that it is il(^ 1 / 3 )-approximate, hence a different strategy is needed to derive con- 
stant approximate algorithms for sorting buffers. This negative result is in line with similar 
observations for several other natural (online) strategies in Racke et al. fl2| . 

On the positive side, we give an Oin log C) -time optimal algorithm for the special case 
in which the size of the buffer is k = 2. The algorithm uses a somewhat special dynamic 
programming approach with a non-trivial combination of data structures that guarantee the 
linear running time in the input size. Note that it is straightforward to obtain a dynamic 
programming algorithm that runs in 0(n k+l ) time; our algorithm improves this significantly. 

Finally, we consider the setting with resource augmentation, where the algorithm is given 
a larger buffer than the optimal one. We give a new LP formulation for the sorting buffer 
problem and show that it can be rounded using a larger buffer size. This gives an 0(1 /e)- 
approximate algorithm using a buffer of size (2 + e) times that of optimal. 

Organization. In Section |2j we show that the sorting buffer problem and its generaliza- 
tion are NP-hard. In Section [3] we provide the LP and the constant factor approximation 
algorithm using a larger buffer size, and finally we give in Section [4] the dynamic program- 
ming algorithm for k = 2. We present the lower bound on the approximation ratio of LFD in 
Section|5] We conclude with some open problems in Section[6j 

2 Complexity 

Theorem 1 The sorting buffer problem is strongly NP-hard. 

Proof We reduce from 3-Partition which is known to be strongly NP-hard [ 8 ] : Given 3q 
positive integers a\ , az, . . . , aj, q and an integer A such that a \ + a2 + . . • + a^ q = qA can we 
partition {1,2,..., 3q} into q sets /; such that aj = A for all i € {1,2,..., q}? 

Given an instance of 3-Partition, we construct an instance a for the sorting buffer prob- 
lem as follows. We multiply all numbers by a large number L. Let bj = Laj for all j 
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and B = LA. We define the buffer size as V = qB + e, with q 2 A < e < L/2. We see 
the buffer as having a main part of capacity qB and an extra part of capacity e. For each 
j G {1,2,. . .,3q} we define a color j. We call these the primary colors. The sequence con- 
tains many other colors but we we do not label those explicitly. We call these the secondary 
colors. The input sequence is defined by 3q + 4 subsequences: 

a = JS YiSm y 2 b\a 2 . . . y q 8 q a q y^S^a^. 

The subsequences are defined as follows. 

j3 contains bj items of color j for each j G {1,2, .. . ,3q}. Items are given in arbitrary 
order. Note that the total number of items equals is qB which is the size of the main 
part of the buffer. 

a, (i=l. . .q+l) contains aj items of color j for each j G {1,2,.. .,3q}. Again, the order 
is arbitrary. 

Yi (i=l. . . q+l) We distinguish between i < q and i = q+l. For i < q it starts with iB 
items of different colors followed by again one item of each of these colors. Any color 
used in Yi is unique in the sense that it appears twice in Yi and nowhere else in the 
sequence a. Sequence y q+ \ is defined exactly the same but the number of colors is 
now two times V — M, where M = \q(q + 1)A. 

8i (i=\...q+\) contains V items of the same color. This color is not used anywhere else 
in a. 

Let us, just for clarity, count the number of colors in a. The subsequences /3 and a, contain 
the 3q primary colors. The sequences 5, together contain q + l colors. A sequence % con- 
tains iB colors for i < q and V — M colors for i = q + 1 (each color twice). The total number 
of colors in the sequence a is C = 3q + q+ 1 + (Ef=i iB) + V —M. 
We list some properties that any optimal solution Opt has. 

Lemma 2 Before the first item of (Xi enters the buffer, OPT must have used the color of 8i. 

Proof The length of 5,- is equal to the buffer size. □ 
We remark that the reduction would be valid without the subsequences 8\ , . . . , 8 q . However, 
these subsequences gives us separations of the server sequence which enhance the analysis. 
Let us call the moment that the server switches to color <5 ( simply as time 5,. 

Lemma 3 We may assume that Opt serves Yi completely before time 5,. 

Proof Since the items of /; that remain in the buffer when Opt switches to color 5; cannot 
be combined with items arriving later, we may as well serve these remaining items before 
switching to 5,-. □ 
It follows from the preceding two lemmas that we may assume that 

Lemma 4 Opt serves the sequences Yi and 5, in the order Yi S\ Y282 ■ ■ ■ Yg+i &qU- 

Assume that a can be served with at most C + 3q color switches. We shall prove that a 
3-Partition exists. For 1 < i < q, let Hi be the set of primary colors used before time <5, . 
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Lemma 5 



^2 cij > iA for all i S {1,2, ... ,q}. (1) 

yeffi 

Proof Assume that Y*jeHi a j < *A — 1 for some i. Then, Lye//, ^/ < £(iA — 1 ) = iB — L. This 
means that from the gfi items of j3 at most iB — L items are removed before time 5j. Hence, 
the free space we have to serve Yi is certainly no more than iB — L + e < iB — L/2. But then 
at least L/2 colors of /; must be used more than once. The total number of color switches 
will be at least C + L/2 > C + 3q (for q > 2). □ 

Lemma 6 Every primary color is used exactly two times: once before time 8 q and once after 
time 8q+i. Every secondary color is used exactly once. 

Proof Taking i = q in Lemma [5] we see that all 3q primary colors must be used before time 
8 q . Further, each primary color must also be chosen at least once after switching to 8 q+ i 
since cc q+ \ contains all primary colors and is served after the switch to 8 q+ \. We see that the 
bound of C + 3q can only be reached if the statement in the lemma holds. □ 
Let h = Hi and /, = Hi for 2 < i < q, i.e., the set of primary colors used be- 

tween and 5,. Consider the buffer contents at time 8 q+ \. If j E /, then the buffer contains 
at least (q + 1 — i)aj items of color j. The total number of primary colored items in the buffer 
is at least 

££(?-i + l)«j = £ £a ; ->£A = ^+l)A=M. (2) 

i=l jeh i=l jeHi i=l z 

If ([2]) holds with strict inequality, then at least one color of y q+ \ is used twice which contra- 
dicts Lemma[6] Hence, equality holds and this can only be true if equality in ([T]) holds for all 
i. This implies that Lye/, a j = A for all i. Hence, a 3-Partition exists. 

The other direction is easily verified. Assume that a 3-Partition I\ , . . . ,I q exist. Then we 
have equality in ([T]). The server takes the colors in /, just before Note that all items from 
the a,'s fit together in the extra part of the buffer. Hence, the free space in the buffer just 
before Yi is at least iB for all i < q and jj can be served such that each color is used only once. 
All primary colors are used exactly twice. □ 

The NP-hardness of buffer sorting extends to the generalized sorting buffer problem with 
multiple servers m, even if m is constant. 

Theorem 7 The generalized sorting buffer problem is NP-hardfor any number of servers m > 
1. 

Proof The idea is to show that the problem with m servers and buffer size V can be reduced 
to the problem with m + 1 servers and the same buffer size. The theorem then follows by 
induction with the induction basis (case m = 1 is NP-hard) given in Theorem [T] 

Assume m = t is NP-hard for some integer I >= 1. Consider an arbitrary sequence p for 
the case m = i. We take a color x not used in p and add k items of color x between any two 
consecutive items in p. Let the resulting sequence be p'. We claim that it is optimal to p' to 
let one server serve only color x and the other ones the remaining colors. Suppose this were 
true, then p can be served using I servers with minimum cost z if and only if p' can be served 
using £ + 1 servers with minimum cost z+ 1. Hence, the case for m = £ + 1 is also NP-hard. 



5 



It is left to prove the claim. For the sake of contradiction, suppose there is an optimal 
solution Opt to sequence p' which does not serve all items of color x by the same server. 
Let m\ be the server which serves the first item of color x in p'. Consider the first moment 
in which an item preempts the sequence of consecutively serving color x by nt\, i.e., an 
item i of color c(i) ^ x is assigned to m\. Let S be the set of items that are in the buffer 
at that moment. We can assume that the next item j that enters the buffer is the first of k 
consecutively incoming color-x items. (Whenever we remove one color-x item from the 
buffer, then we can serve all of them without extra cost.) Hence, with the buffer capacity k, 
Opt must serve at least one (and thus w.l.o.g. all) items of color x before an new item with 
color different from x can enter the buffer. 

Consider the schedule after Opt served the color x items by some server, say mi. Sup- 
pose m\ 7^ mi. While the current color of mi is x, server mi might have served after i some 
items of the same or other colors from S; let i' be the last item assigned to mi so far. Now, 
we simply exchange the current output sequence on server mi from item i up to with the 
sequence of color-x items on mi. This is feasible since we only swap output positions of 
items in S that are in the buffer or enter with the same color x. Note, that the currently active 
colors of the servers are not changed. Moreover, the cost of the schedule can only decrease: 
Moving the color-x items to mi reduces the cost by one and moving the sequence starting 
with item i to mi does not cause a new color change. Thus, Opt was not an optimal solution. 

If mi = mi, then we extract from the output sequence on mi the subsequence i up to i', 
and assign it to the end of the current sequence of some server, say mi. Clearly, the current 
color of mi changes and may cause an additional unit of cost when OPTassigns the next item 
to mi. However, we reduce the cost by one unit when removing the color change on mi for 
switching back to color x. Thus, the cost do not increase. This exchange can be applied 
iteratively to an optimal solution until no items of a color different from x is assigned to mi. 
□ 

3 Resource Augmentation 

In this section, we give an LP-based algorithm which yields an 0(1/ e)- approximation with 
respect to the optimal solution that uses no more than 1/2 — 2e times the original buffer 
size. By scaling up the buffer size by a factor of 2 + 0(e), it gives an 0(1/ e)- approximate 
algorithm using a buffer size of 2 + e times that of optimal. 

We first introduce a new LP relaxation, followed by a rounding scheme. We consider 
that the buffer is empty initially. For each time step i = 1 , 2, . . . , n, the following three events 
occur. (1) The i-th item is moved to the buffer, (2) the algorithm chooses c(i) to be the color 
of the buffer, and (3) all items in the buffer with color c(i) are removed. Call an interval a 
c-interval if the color of the buffer is c throughout the interval and call it non-c if the color 
is not c throughout the interval. The cost for serving a color c is the number of maximal 
c-intervals. Note that the cost over all colors is exactly 2 — C plus the number of maximal 
non-c intervals. One observation is that after each time step i = 1,2, ... , n, the number of 
items in the buffer should be at most k — 1 . It motivates the following IP. 

We define a variable y c s t for every color c and time steps s, t with 1 < s < t < n. y c s t should 
be one if [s,t] is a maximal non-c interval; and it is zero otherwise. For each color c and time 
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step s < i, let A c s i be the number of items with color c moved into the buffer during [s, i] 



minimize 2 — C + ^^y c st 

c s,t: 
s<t 

subject to £ Xv t — 1 

s,t: 

s<t; s<i+l; i<t 

c s,t: 
s<i<t 



.V<1<« 



y«e{o,i} 



for all c and i = 1, 2, . . . , n +k — 1 (3) 



forall i = l,2,...,n+£-l (4) 



for all i = 1,2, ... ,n — 1 (5) 



for all c and i = n + k — 1 (6) 



for all c and s,fe{l,2,...,n+£ — 1}. (7) 



The first constraint ((3]) ensures two things: (i) for any color c and time i, i is included in at 
most one maximal non-c interval and (ii) maximal non-c intervals are really maximal, i.e. if 
y c s t = v = 1 then £<w + 2orv<5 , + 2. By (i), each color c contributes at most 1 to the left 
hand side of the second constraint ([4]). Hence this constraint ensures that at any time i, the 
color of the buffer is different from exactly C — 1 colors. Constraint ([5]) ensures that by the 
end of each time step i < n — 1, the number of items remaining in the buffer is at most k — 1 
and constraint (|6]) ensures that the buffer is empty at the end. It is easy to verify that for any 
valid schedule we can set the values of y c s t according to whether it is a maximal non-c interval 
and this satisfies all the constraints. Reversely, any IP-solution corresponds with a feasible 
coloring sequence with the same cost. The LP-relaxation is obtained by replacing Q with 
y c s t ^0- ^ i s eas y to verify that any LP-solution has value at least C. We can round the LP 
to get an 0(1 /^-approximation against an optimal solution that uses no more than 1/2 — 2e 
times the buffer size. Define 



s,t: 
s<i<t 



Intuitively, 1 — x\ is the fraction of color c on the machine at step i. Further, define 

4= £ f s>i , and Zf=£4 
s: j=l 
\<s<i 

The variable zj sums over all intervals ending in i and the variable Zf sums over all intervals 
ending in / or before that. In particular, Z c n is the LP-cost for color c. The value Zf is non- 
decreasing in i. We mark every step that Zf increases by another e. More precisely, mark 
the first step i for which Zf > e and mark every next step i' for which Zf, has increased by at 
least £ since the last marking. 

A feasible integral solution is found by the following rounding scheme. 
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LP Rounding. Start with an arbitrary buffer color. For i = 1 to n + k — 1 do: 

(i) Remove all items with the current color (state) of the buffer. 

(ii) For each marked color c, remove all its items. 

(iii) If xf < 1/2 — £ for some d, then switch the color to d and remove all items with 
color d. 



Theorem 8 The LP Rounding Algorithm applied to an optimal LP solution yields an 0(1 /e)- 
approximate solution for the sorting buffer problem when the optimum is using a buffer of 
size at most 1/2 — 2e times the original buffer size k. 

Proof First we argue that (iii) is well defined. Constraint Q states that £ c jc? > C— 1 and ([3]) 
states < 1. Hence, there is at most one d for which x? < 1/2 — e. (*) 

The first step (i) is done for free, and one can easily verify that only the just entered item 
is possibly removed in this step. Clearly, the number of markings is 0(1 /e) times the LP 
cost. Consider two consecutive switches. If at least one of the two is due to a marking then 
we charge both to the marking. To prove that the total number of switches is 0( 1 /e) times the 
LP cost we only need to bound the number of pairs of consecutive switches in which both are 
of type (iii). Assume the buffer switches to d in step i and subsequently switches to another 
color c" in step j > i and both are of type (iii). We have x\ < 1/2 — e and x c j < 1/2 — e. The 
first implies that xf > 1/2 + e; see (*). Hence, x c - —xf < —2e. 

Notice that for every j > i and c holds that 

*W=L^-L^= I ylt- I f„>o- £ f„ 

s,f. s,t: s,t: s,t: s,t: 

«<j<t s<i<t i+l<s<j<t s<i<t<j—\ s<i<t<j—\ 

= -(zy x -zu). 

Therefore, 2e < xf —x c - < Z c -_ x —Zf_ x . Thus, for color c" there is an increase of the Z- 
variable of 2e between two switches of the third type. We conclude that the total cost due to 
switches of the third type is also 0(1/ e) times the LP cost. 

Now we bound the capacity needed. Consider any c and step j and let i < j be the last 
time before j that c was removed from the buffer in the rounded solution. We may assume 
that c was not removed at step j since otherwise there are no items of color c at the end of 
step j. Denote the term for color c in constraint (|5]) by a c j. 

"'i L 

s.t: 

s<j<t 

Intuitively, a c - is the amount of color c in the buffer at step j in the LP-solution. On the other 
hand, the number of items of color c in the buffer at step j in the rounded solution is A c i+l •. 
To relate the rounded solution to the LP-solution we are interested in the variables that corre- 
spond to (s, t) -intervals with s < i + 1 < j < t. For these (s, ?)-intervals we have A c s t > A c i+l .. 
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Since we have not picked color c in steps i + 1 , . . . , j, we have Z c - —Zf < s. Note further 
that Y,s,t:s<i+i<j<ty C st — -4+1 ~Z C j-i +Zf . Since c is not removed at step i + lwe have x c i+l > 
1/2 — s. Using additionally Z^ j < Zj we conclude that 

I y c s , t >^-e-Z c j_ 1+ Zf >i-e-z c j+ Zf>^-2e. 

s,t: L 
s<i+l<j<t 

Finally, we can relate the amount of c in the LP-buffer with the number of c in the buffer of 
the rounded solution. 

a ) = I </v;, > I K^, > Af +1J I tf, > Af +1 j (\ - 2s) . 

s.t: s,t: s,t: \^ / 

s<j<t s<i+l<j<t s<i+l<j<t 

Hence, the total number of items in the buffer after step j is A c i+i j < a c j/( 1 /2 — 2e) < (k — 
l)/(l/2 — 2s). Moreover, when j = n + k — 1, we have dj = by constraint ([6]>. This implies 
that the buffer is empty at the end. □ 



4 Dynamic programming 

Straightforward dynamic programming algorithms solve the sorting buffer problem opti- 



mally in running time 0(n k+l ) or 0{n c+l ); see also [10 11 1. In this section we consider 
the special problem setting with a buffer of size k = 2, and give an algorithm with linear 
running time for this special case. This is optimal since the size of the input is 0(«logC). 

Theorem 9 There is an optimal algorithm solving the sorting buffer problem with buffer 
size k = 2 in time 0{n\ogC). 

In our dynamic programming algorithm, we maintain the optimal cost Opt,, a set Si of 
colors, and the sizes of those colors. A color c is in 5,- if there exists an optimal way to serve 
the first i items in the sequence such that an item of color c is served last. The size of a 
color is the (or, a possible) number of items of this color that are served together if this color 
is served last. In order to use only linear time, from one step to the next we only store the 
changes in 5, and in the sizes of the colors. This works because the number of these changes 
is amortized constant per step. 

We can initialize S\ = {c\} and OPTi = 1. The cost Opt,- increases as soon as c\ ^ c\ 
for some i; as long as c\ = C2 = ■ ■ ■ = c,-, we have Opt,- = 1 and size{ci) = i. 

Definition 1 For any step i > 1, let j < i be the most recent step such that cj ^ C{. If there is 
no such step, set j = 0. 

Observation 1 For each i > 1, we have < |5,-_i | + 1. 

The only color that could possibly enter the set of optimal finishing colors is the color of 
the most recent item; any other color would have been optimal before. 

Lemma 10 At any step i, there can be at most one color c such that size(c) > 1; this is 
color Cj. 
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Proof Suppose there is any other color c in 5; with size{c) > 1. Then the last two items in 
some optimal serving order have color c / c,. But then item i is served in step i — 2 or before, 
i.e. before it entered the buffer, a contradiction. □ 
We are now ready to present our dynamic program. As stated, it begins processing as 
soon as a color different from c\ appears in the input. In each step, it determines the current 
set Si and the sizes of all the colors, based on this information of the previous step. 

1 . Before processing the ith item, store the answers to the following questions using S,-_ i 
and the current sizes of colors (i.e., as they are after processing step i — 1): 

(a) Cj€Sj-i? 

(b) a-i€Si-il 

(c) If so, do we have size(ci-\) = 1? 

(d) c,+i G Sj_i? (We may need this information in step i+ 1; hence, we need to 
remember this bit for one step) 

Finally, if c; 7^ c,-_ 1 , set j = i—l and update the bits indicating whether Cj = C[- \ G S,-_ 1 
and Cj G Sy_i = S,_2 (using the answer to (d) that was stored in the previous step; if this 
is the first step that the dynamic program is executed, we have c, ^ S,_2). If Cj = Q-i, 
keep those bits unchanged. 

2. If c ; - G 5,_i, then Opt,- = Opt, 1 and size(cj) increases by 1. If c,- = c,_i, 5; remains 
unchanged. Else, 5, consists of at most two colors: c; and possibly c,-_i. This color c,_i 
is only in 5; if c,_i G S,_i and size(c,_i) = 1. 

3. If c, ^ 5,_i, there are two cases. 

(a) If ct G Sj-u cj G 5,_i and size(cj) = 1, then S, = {cj}, size(cj) = 1, and Opt, = 
Opt,_i. 

(b) Else, add q to 5 ( _i to get 5,-, let size(ct) = 1, and Opt,- = Opt,-_i + 1. If c, G 5,_2, 
c,_i G5,_i and jjze(c,_i) > 2, then we still have c;_i G 5,-, but now with size(c{- 1 ) = 
1. There are no other changes between 5,_i and 5;. 

Lemma 11 TTi/s dynamic program calculates the optimal cost. 

Proof We use induction. For the base case (i = 1), the optimal cost is 1, and Si = {ci}. 
Consider a later step i. For item i, there are only two options: the item must be served last or 
next-to-last (because it does not enter the buffer until i — 2 items have been served). 

The easiest case occurs when c,- G S,-_i : if it was optimal to serve the first i — 1 items and 
finish in color c,-, this is now still optimal, and we can just serve the new item last. (Step 2.) 
However, we still need to determine the other colors in S,. Now, if c; = c,_i, consider any 
color c G Si- i,c/ Cj. By induction, an optimal way to serve the first i — 1 items is to end 
with colors c,-_ 1 , c in this order (item i — 1 cannot be served before step i — 2). It follows that 
we can now still use this order without increasing the cost, since c,- = c,-_ 1 . Hence, we find 
that Si = S,_i. 

If C{ / Ci-i, Ci-\ G S;_i, and size(c,-_i) = 1, then item i — 1 can be served last as well. 
This holds because c, G S,_2 (a color can only enter S, when it is being requested), so an 
optimal way to serve the first i — 1 items is to end with colors c,-,c ; _i in this order, and we 
can still use this order now without extra cost. 

Consider the case c, G" S,-_i. There are two possible reasons for not serving item i last: 
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• Item i can be combined with an earlier item of the same color (but only if item i is not 
served last). To find out whether this is the case, we consider the set 5 7 _i of optimal 
finishing colors just before item j arrived. If c; G Sj-\, then color c,- was dropped 
from the set of finishing colors when item j arrived. In this case we can combine 
items j + 1 , . . . , i (that all have color c,) with a previous item of color c; if and only 
if size(cj) = 1 (because that allows us to keep item j in the buffer while serving color i 
without increasing the total cost for color cj). (Step 3(a).) 

Else, it is optimal to serve item i last, but the optimal cost increases by 1 compared to 
the previous step. 

• Some other item can be combined with a future item of the same color. (This is the 
reason why we keep track of optimal finishing colors.) It can be seen that any color 
in 5,-_i with size 1 can also be delayed for one more step without increasing the cost 
further (note that we are in the case where the optimal cost has increased compared to 
the previous step). A color c in 5,-_i of size at least 2 could be served last by splitting 
it into two parts and keeping one item until the end. But in order for this to be optimal, 
we must have c, G 5,-_2, since we now pay 1 more for color c and hence must serve 
color ci with the same total cost as before, i.e., item i must be served together with a 
previous item of the same color, while also being served in step i — 1 (or i). 

We complete the proof by showing that S; = {cj} in Case 3(a). This is a case in which 
the optimal cost for serving the first i items is the same as it was for serving the first i — 1 
items. In particular, the cost to serve color q did not increase, although c; ^ 5,-_i. Hence, in 
any optimal serving order, item i must be combined with at least one previous item of the 
same color. In particular, item i must be served as the penultimate item (since it cannot be 
served earlier, and if it is served last, we must use a suboptimal way to serve the first i — 1 
items if we want to combine item i with a previous one of the same color). In fact, the items 
served in all steps j — 1 1 must be of color c,-. This is clear if j = i — 1. Else, all items 
following j have color c,, and yet c, ^ 5,-_ i . This can only happen if the items j + are 
served in steps j, — 1, following another item of color c, which is served in step j — I. 
This means that item j, which is not of color i, must be served in step i, thus fixing 5; = {cj} 
(and size(cj) = 1). □ 

Hence, we maintain for each step the optimal cost so far, whether color c; enters the set 
of optimal finishing colors, which colors leave, and which unique color has size more than 1 
(if any). It is a nontrivial task to maintain these things in only linear time, and in particular to 
do this in such a way that an actual optimal solution can be constructed afterwards (and not 
just the optimal cost). We are going to use three objects: 

• An array S of size C <n, where C is the number of different colors. In this array, S[c] 
indicates the (current) size of color c in 5,-. (We assume the colors are given by numbers 
from 1 to C.) Also, with each item S[c] we associate a pointer to c in the list L below. 
(If S[c] = 0, it is a null pointer.) 

• A doubly-linked list L which at all times has size |5,| and contains links from each item 
c G Si to S[c] to indicate which colors are nonzero (we need this in order to efficiently 
remove items from S whenever needed). 

• An array H of size at most In in which we store the entire history of changes in S and 
Opt. For each i, the first number indicates whether Opt, > Opt,-_i. Then, we have 
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a sequence of pairs (color, change), followed by a zero to mark the end of processing 
for this i. 

Regarding the size of H, in each step i at most one color can enter 5, and many may leave. 
However, the latter ones must have entered before. Since each item in the input may cause 
only its color to enter Si (and this happens at most once for each item), and each item may 
cause only preceding colors that entered Sj to leave Si, the total number of these changes is 
at most 2n. Finally, each item i may cause one size of one other color d G S\ to drop to 1 (in 
Step 3(b)); we have at most n/2 such events, since the size of c' must first have increased to 
above 1. 

In total we have at most 5«/2 changes that can be stored in an array of length n + 5n + n = 
In, where for each i we first store the possible change in Opt,-, then use two places for each 
change in S indicating the color and the amount of change (positive or negative), and finally 
a separator bit. 

Note that by doing it in this way, we need to store numbers up to n (the possible decrease 
of a color size in one step), which takes logrc place, for a total space requirement of ralogn. 
However, we could also encode a decrease of d for color c by using d successive entries c. 
Naturally we do need to specify colors, which takes logC bits, so the overall space and time 
requirement can be limited to 0(?ilogC), i.e., linear in the size of the input, which is a list of 
n colors. 

The questions in l(a-d) can be answered in O(logC) time by checking the array S. In 
fact, all operations in the dynamic program take O(logC) or constant time apart from clearing 
the set Si in Step 2 and 3(a) when needed, the cost of which however can be amortized as 
argued above. 

To find an optimal way to serve the sequence, we can finish with any color in the array S 
as it is when step n has been processed. We then search the input for this color, starting from 
the end. As soon as we find it, say at position i, we know that it is optimal to keep item i in 
the buffer in the end, and therefore to serve items i + l,...,n at places i, ... ,n — 1. We can 
then reconstruct Sj_i from S n using the changes that we stored, take any color from S ( -i, and 
repeat. This also takes only linear time, and thus, Theorem[9]follows. 

In Appendix II, we will also give an example to illustrate the algorithm and the storage 
and access of information in the described data structures. 

5 A lower bound for LFD 

The well-known paging problem has several offline algorithms that solve it to optimality. 
One of those is the Longest Forward Distance (LFD) algorithm (5). With the mentioned 
relation to the sorting buffer problem, it is reasonable to consider a natural adaption of this 
algorithm for sorting buffer. In the following we give a negative result that rules out LFD as 
a candidate for a constant approximation algorithm. 



Longest Forward Distance (LFD). 

If no item can be served without a color change, then choose the color of 
item i that has its next occurrence j > i farthest in the future of the sequence. 
If no more items j with the same color as i exist, the distance is infinity. 



Theorem 12 LFD has an approximation ratio of at least Q.(k 1 ^). 
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Proof Consider the following input instance. Given is a buffer of size M + n, where M >n 3 . 
The sequence of items is as follows; we describe each item by its color (natural number), and 
we denote by a h that the item with color a appears b times consecutively. 

[0 M ] 

[ 123. ..n] [2 3 2 4 3 ... n^ 1 ] 
[0123...H-1] [2 3 2 4 3 ... (n 
[0123...H-2] [2 3 2 4 3 ... (n 

[0123] [2 3 2 ] 
[012] [2] 
[01] 

The sequence consists of n + 1 lines; let us denote them as Lq,Li, . . . ,L n . Initially, the buffer 
contains all items of line Lq and the first block (in brackets) of L\. An optimal solution 
chooses color first; it can serve all items of this color and by the end, all remaining items of 
the sequence are in the buffer. Thus, there are no more than n + 1 color changes necessary. 

LFD chooses color 1 first, moving the next item of color 2 into the buffer. Then it picks 2, 
moving two items of color 3 into the buffer and repeats until it chooses n and moves the first 
block of L2 into the buffer. Then the process repeats. This way, LFD causes n — i color 
changes serving the first block of line L,. Thus, it has total cost n(n + l)/2. 

The ratio of LFD's cost and the optimal cost for this sequence are n/2. Hence, LFD has 
an approximation ratio bounded by D.(k 1 ^ 3 ) for a given buffer of size k. □ 

6 Open problems 

Now that NP-hardness has been settled, the main open problem is to design a polynomial 
time constant factor approximation. In the introduction we listed several partial results on 
this. Given our LP-rounding result, a natural next step is to design an algorithm that gives 
an 0(1 /^-approximation against an offline solution using only (1 — e)k capacity, instead of 
(l/2-2e)Jfc. 

We gave a dynamic program for k = 2 which has a significantly better running time than 
the straightforward DP. It would be interesting to give an exact algorithm for with a running 
time that is much less than 0(n k+l ). 

Our NP-completeness proof is not approximation preserving. It remains a question 
whether the buffer sorting problem is APX-hard or not. 
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